Approximate Query Answering In Numerical Databases

نویسندگان

  • Nabil I. Hachem
  • Chenye Bao
  • Steve Taylor
چکیده

Scienti c databases are usually large, distributed and dynamically changing. We address the problem of e cient processing of queries in scienti c databases, especially in very large numerical databases. Previous work has focused on how to store the database and the design of index structures for the e cient access of data. Recently more and more statistical methods have been used in query optimization. Those methods essentially attempt to approximate the distribution of the attribute values in order to estimate the selectivity of query results. We introduce a new methodology that uses regression techniques to approximate the actual attribute values. Through analysis of the data, one derives a set of characteristic functions to form a \regression database," a compressed image of the original database. Based on these functions, approximate answers to queries may be provided within a pre-speci ed tolerable error, but without the expensive search overhead usually inherent with the use of indexing techniques. We propose a framework to build regression databases. An experimental prototype is implemented to evaluate the technique in terms of realizability, e ciency and practicality. The results demonstrate that our approach is complementary to conventional approaches and to statistical methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Approximate Answers for XML Queries with Range Predicates

In this paper, we tackle the difficult problem of summarizing the path/branching structure and numerical value content of an XML database. We introduce a novel, powerful XML-summarization model, termed VTreeSketches, that enables accurate approximate answers for the class of twig queries with numerical-range predicates. In a nutshell, a VTreeSketch synopsis represents an effective clustering of...

متن کامل

Cooperative Query Answering for Approximate Answers with Nearness Measure in Hierarchical Structure Information Systems

COOPERATIVE QUERY ANSWERING FOR APPROXIMATE ANSWERS WITH NEARNESS MEASURE IN HIERARCHICAL STRUCTURE INFORMATION SYSTEMS Thanit Puthpongsiriporn, Ph.D. University of Pittsburgh Cooperative query answering for approximate answers has been utilized in various problem domains. Many challenges in manufacturing information retrieval, such as: classifying parts into families in group technology implem...

متن کامل

Providing Approximate Answers Using a Knowledge Abstraction Database

As database users adopt a query language to obtain information from a database, a more intelligent query answering system is increasingly needed. Relational databases are exact in nature, but effectiveness of decision support would improve significantly if the query answering system returns approximate answers rather than a null information response when there is no matching data available. Thi...

متن کامل

Accuracy and Efficiency of Fixpoint Methods for Approximate Query Answering in Locally Complete Databases

Standard databases convey Reiter’s closed-world assumption that an atom not in the database is false. This assumption is relaxed in locally closed databases that are sound but only partially complete about their domain. One of the consequences of the weakening of the closed-world assumption is that query answering in locally closed databases is undecidable. In this paper, we develop efficient a...

متن کامل

Approximate Query Answering in Locally Closed Databases

The Closed-World Assumption (CWA) on databases expresses that an atom not in the database is false. A more appropriate assumption for databases that are sound but partially incomplete, is the Local ClosedWorld Assumption (LCWA), which is a local form of the CWA, expressing that the database is complete in a certain area, called the ‘window of expertise’. Databases consisting of a standard datab...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996